An investigation into the population abundance distribution of mRNAs, proteins, and metabolites in biological systems

نویسندگان

  • Chuan Lu
  • Ross D. King
چکیده

MOTIVATION Distribution analysis is one of the most basic forms of statistical analysis. Thanks to improved analytical methods, accurate and extensive quantitative measurements can now be made of the mRNA, protein and metabolite from biological systems. Here, we report a large-scale analysis of the population abundance distributions of the transcriptomes, proteomes and metabolomes from varied biological systems. RESULTS We compared the observed empirical distributions with a number of distributions: power law, lognormal, loglogistic, loggamma, right Pareto-lognormal (PLN) and double PLN (dPLN). The best-fit for mRNA, protein and metabolite population abundance distributions was found to be the dPLN. This distribution behaves like a lognormal distribution around the centre, and like a power law distribution in the tails. To better understand the cause of this observed distribution, we explored a simple stochastic model based on geometric Brownian motion. The distribution indicates that multiplicative effects are causally dominant in biological systems. We speculate that these effects arise from chemical reactions: the central-limit theorem then explains the central lognormal, and a number of possible mechanisms could explain the long tails: positive feedback, network topology, etc. Many of the components in the central lognormal parts of the empirical distributions are unidentified and/or have unknown function. This indicates that much more biology awaits discovery.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification, Distribution and Abundance of Macroinvertebrates and Biomonitoring of the Ghale Rudkhan River, Guilan Province

Benthic macroinvertebrate-based indices are suitable tools for assessment of stream health and human impacts on its biological integrity. Since Ghale Rudkhan River is an attractive tourist destination and its surrounding population is growing, its water quality was examined usning biological indicators. Macroinvertebrate samples were taken monthly by a Surber sampler (mesh = 300 µ and area = 90...

متن کامل

Evaluation of In Vitro Differentiation of Cardiomyocyte-like cells Derived from Human Bone Marrow Mesenchymal Stem Cells

Purpose: To investigate the in vitro differentiation process of cardiomyocyte-like cells derived from human bone marrow mesenchymal stem cells under the influence of 5-azacytidine (5-aza). Materials and Methods: After purification, human bone marrow mesenchymal stem cells were exposed to 5-aza at a concentration of 5 μmol for 5 weeks to induce cardiomyocyte differentiation. To induce differenti...

متن کامل

An investigation on age, growth and biological characteristics of red mullet (Mullus barbatus ponticus, Essipov, 1927) in the Eastern Black Sea

This study was carried out during May 2010 – April 2011 in order to determine various biological properties of red mullet occurring in the Eastern Black Sea region. The average length and weight of 1435 specimens were determined as 13.13 cm and 23.14 g, respectively. Weight-length relationship was determined as W = 0.0088 L3.0338. The age distribution of this population ranged between I and VII...

متن کامل

I-49: Human Y Chromosome ProteomeProject

The success of the Human Genome Project (HGP) has provided a blueprint for the approximately 20,000 gene-encoded proteins potentially active in all of the hundreds of cell types that make up the human body. Yet we still have limited knowledge about a majority of the gene-encoded proteins which are the “building blocks of life” and “cellular machinery”. It is estimated that for nearly half of th...

متن کامل

Arabidopsis leaf plasma membrane proteome using a gel free method: Focus on receptor–like kinases

The hydrophobic proteins of plant plasma membrane still remain largely unknown.  For example in the Arabidopsis genome, receptor-like kinases (RLKs) are plasma membrane proteins, functioning as the primary receptors in the signaling of stress conditions, hormones and the presence of pathogens form a diverse family of over 610 genes. A limited number of these proteins have appeard in pr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 25 16  شماره 

صفحات  -

تاریخ انتشار 2009